CDS

Accession Number TCMCG075C04492
gbkey CDS
Protein Id XP_017981978.1
Location join(37141068..37141485,37142090..37142216,37142321..37142427,37142737..37142880,37143351..37143491,37143595..37143777,37144374..37144552,37144713..37144739)
Gene LOC18614660
GeneID 18614660
Organism Theobroma cacao

Protein

Length 441aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_018126489.1
Definition PREDICTED: SURP and G-patch domain-containing protein 1-like protein isoform X1 [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category L
Description SURP and G-patch domain-containing protein 1-like
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko03041        [VIEW IN KEGG]
KEGG_ko ko:K13096        [VIEW IN KEGG]
EC -
KEGG_Pathway -
GOs -

Sequence

CDS:  
ATGGAGAAAGGAGTGCCATCTAGCCTTTTTGTTAATGATGGTTCCTTCATGGAGAGGTTTAAACAGCTTCAACAACAGAAGGATGAAAAAGACAAAGCTGCTGCTGCCTTAGAGGAATCTAAACCCCCCAAAATCGTTAAAGGGTCTTCAGCTCCCAAGCCTGCTATTGCTCTTAACAAAATTTCCATGGATTTTAAGCACAATGATGCACGCAAGACCTCCCAAACTTCTTCTGGGGGCAAACTTGCATTCAGCTTGAAACAGAAGTCAAAGCTTGTGGCACCTCCTGTTAAGTTGGCTGCAGACGAGGATGAAGAGGACCAAGATGCAGGAAAGTTGTCAGATGACACACCCGTAAAGCGGCAAAAGTTGTGTCAAGCAGATACCTCCGAACTAGCATCAAAACAAGTGGATGTTGCACTACCTTCCCCAAGTGATCCCAATGTGAAGAAAGTTGCAGACAAACTAGCAAGTTTTGTTGCCAAAAATGGAAGGCAGTTTGAGCATATTACACGGCAAAAAAACCCTGGAGACACACCTTTTAAATTCCTTTTTGATGAGAGCTGTTCTGATTACAAATACTATGAATTCCGGCTTGCTGAAGAGGAAAAAGCTCTTGTACAGAACAAGGAATCTCAAACTCCTCAAAGTGGTGGTATGAGCTTTTCAGCTACTAAGTCCACAAGCAGCTCCCTTAGGTCAGGTCTGCAGCAATCAAGTTATCAAATGCCTGCCTCTGCTTTGTATGAGAATAATGAGGAGCCTAGATCTTCTGCGATGTCAGCAGGAAGAGCAGTTGCAGGTTCATCCAGTGCTCCAACAGGTGCAGATCCTATAGCAATGATGGAGTTTTACATGAAGAAGGCTGCTCAGGAAGAGAAGATGAGACTGCCTAAGCAGTCCAAAGATGAGATGCCTCCACCTCCTTCCCTTCAAGGAGCTCCTTTGAAGAAAGGTCATCACATGGGTGATTATATCCCACCAGAAGAGCTTGAAAAGTTTTTGGCTGCCTGCAACGATGCTGCTGCTCAAAAAGCTGCACGGGAGACTGCAGAGAAGGCAAAGATTCAATCTGATAATGTTGGGCATAAACTTTTGTCAAAAATGGGTTGGAAAGAAGGTGAGGGTTTAGGGGGCTCCAGAAAGGGTATTTCAGATCCGATCATGGCTGGTGATGTAAAGATGAACAATTTGGGGGTTGGTGCTCATCATCCTGGAGATGTGACTGCAGAGGATGATATATATGAGCAGTATAAGAAACGGATGATGCTTGGTTATCGATACAGACCAAATCCTCTGAACAATCCTCGAAAGGCATACTATTGA
Protein:  
MEKGVPSSLFVNDGSFMERFKQLQQQKDEKDKAAAALEESKPPKIVKGSSAPKPAIALNKISMDFKHNDARKTSQTSSGGKLAFSLKQKSKLVAPPVKLAADEDEEDQDAGKLSDDTPVKRQKLCQADTSELASKQVDVALPSPSDPNVKKVADKLASFVAKNGRQFEHITRQKNPGDTPFKFLFDESCSDYKYYEFRLAEEEKALVQNKESQTPQSGGMSFSATKSTSSSLRSGLQQSSYQMPASALYENNEEPRSSAMSAGRAVAGSSSAPTGADPIAMMEFYMKKAAQEEKMRLPKQSKDEMPPPPSLQGAPLKKGHHMGDYIPPEELEKFLAACNDAAAQKAARETAEKAKIQSDNVGHKLLSKMGWKEGEGLGGSRKGISDPIMAGDVKMNNLGVGAHHPGDVTAEDDIYEQYKKRMMLGYRYRPNPLNNPRKAYY